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(57) The present invention discloses a butterfly 
processor capable of performing convolutional decod- 
ing and LogMAP decoding in telecommunications sys- 
tems. First and second add^compare-select modules 
are provided for receiving input path metrics. A branch 
metric calculator is also provided for receiving input data 
and extrinsic data. The branch metric calculator gener- 
ates output branch metrics to each of the first and sec-, 
ond add-compare-select modules. Each of the add- 
compare-s elect modules includes a log-sum correction 
means coupled to compare and select components. A 
controllable switch selectively couples outputs of the se- 
lect components and the log-sum corrections means to 
enable either one of convolutional or LogMAP decoding. 
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Description 

Technical Field of the invention 

[0001] The present invention relates generally to decoding in a wireless communications system and, in particular, 
to a butterfly processor for decoding in wireless communications systems. 

Background Art 

[0002] Communication systems deal with the transmission of information from a transmitter to a receiver. The trans- 
mission medium through which the information passes often contains many sources of noise, including cosmic radia- 
tion, Additive White Gaussian Noise (AWGN), Rayleigh scattering (multipath propagation) and electromagnetic noise. 
The presence of these noise sources corrupts or prevents the transmission of the desired information, thus limiting the 
ability to communicate. 

[0003] It is well known in the art that coding of the information to be transmitted, through the addition of redundant 
information calculated from the source information, improves the ability to successfully receive the transmitted infor- 
mation. Decoding uses the redundant information to detect the presence of errors or estimate the most probable emitted 
bits, given those received. Errors are detected when the transmitted redundancy is different from that subsequently 
calculated with the received data. 

[0004] The weight of a codeword is a measure of the capacity to recover data from the codeword. A codeword with 
a high number of bits has a high weight. A low weight codeword exhibits a low ability to recover data, whereas, con- 
versely, a high weight codeword exhibits improved recovery of data. 

[0005] Automatic-repeat-request (ARQ) coding schemes employ an error-detection code. If the presence of an error 
is detected in the information received, a message requesting retransmission of the relevant information is sent from 
the receiver to the transmitter. ARQ coding schemes are relatively simple, but require the use of a feedback channel 
and deliver variable and comparatively slow throughput. 

[0006] Forward error correction (FEC) coding schemes are used to encode information in systems in which propa- 
gation delays and latency are of concern. The receiver is able to detect and correct errors, without requiring a feedback 
channel. 

[0007] Coding schemes can be broadly categorised into block codes and convolutions codes. 
[0008] Block codes map a message of k information bits into a structured sequence of n bits, where n>k. The code 
is referred to as a (n,k) code. The ratio (n-k)/k is called the redundancy of the code and the ratio of information bits to 
the total number of bits, k/n, is called the code rate. The extra bits inserted provide redundancy and are used by the 
decoder to provide error detection and correction. The redundant bits added during encoding are only dependent on 
the k information bits in the message block. Block codes are often used to detect errors when ARQ is implemented. 
[0009] Convolutional encoding generates a block of n code bits in a given period of time from k information bits, 
where n and k are typically small. The block of n bits generated by the encoder Ss dependent not only on the k information 
bits of the time period, but also on the message blocks generated during a predefined number of preceding time periods. 
The memory thus imparted on the coding enables errors to be corrected based on allowable sequences of codes. 
Convolutional decoding may be performed using either a Viterbi algorithm or LogMAP algorithm. 
[0010] Convolutional codes are preferred for wireless voice communications systems in which the retransmission of 
data and its associated delay is intolerable. Block codes are capable of delivering higher throughput and are preferred 
for the transmission of data where latency is less of a concern. 

[0011] Turbo codes, also known as parallel concatenated codes, are a class of codes whose performance is very 
close to the Shannon capacity limit. Turbo coders are implemented by connecting convolutional encoders either in 
parallel or series to produce concatenated outputs. Bit sequences passing from one encoder to another are permuted 
by an interleaves In this manner, low-weight code words produced by a single encoder are transformed into high- 
weight code words. Turbo decoding thus takes two low weight codewords and obtains the effect of a much higher 
weight codeword. 

[001 2] At present, consumer wireless communication systems are primarily concerned with the transmission of voice. 
Such wireless communication systems include Advanced Mobile Phone Service (AMPS), Global System for Mobile 
Communication (GSM) and Code Division Multiple Access (CDMA). These represent the first (1G) and second (2G) 
generation systems. With the convergence of data and voice communication systems, the second-and-a-half gener- 
ation (2.5G) and third generation (3G) systems are emerging in which the transmission of data is becoming a more 
Important concern. In order to achieve superior error performance at higher transmission rates, turbo block encoding 
is preferred. The latency endemic to block coding is not as significant an issue as it is with the transmission of voice. 
New, third generation mobile wireless standards, like Universal Mobile Telecommunication Service (UMTS) and 
CDMA2000 require turbo encoding for data streams and convolutional encoding for voice streams. These systems 
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require a complex turbo decoder for data and a Vlterbi decoder for voice. Furthermore, backward compatibility requires 
that second generation standards are also supported. 

[0013] The transmission of voice and data provides conflicting requirements of transmission rate versus latency and 
propagation delay. The current mode of addressing these problems is to provide separate encoding systems: turbo 
encoding for data streams and convoluttonaj encoding for voice streams. Consequently, different decoders are also 
required, resulting in a multiplicity of hardware platforms and thus increased costs for telecommunications operators. 
[0014] Butterfly processors typically consist of a number of Add-Compare-Select units and a Branch Metric Calcu- 
lator Known butterfly processors are capable of performing one of Viterbi or LogMap calculations. Consequently, In 
order to perform Viterbi and LogMap calculations, two distinct types of butterfly processors are required, resulting in 
additional hardware costs. 

Summary of the Invention 

[0015] In accordance with the principles of the present invention, a Butterfly Processor arrangement is provided 
which is capable of performing convolutions and LogMap calculations. The Butterfly Processor arrangement of the 
invention includes a Log-Sum correction device coupled to well known compare and select components. A controllable 
switch for selectively coupling outputs of the select component and the Log-Sum correction device is provided to pro- 
duce a desired output of the Butterfly Processor. Advantageously, the invention provides a reduction in hardware as 
a single Butterfly Processor can perform calculations which currently require two distinct Butterfly Processors. 

Brief Description of the Drawings 

[0016] A number of preferred embodiments of the present invention will now be described with reference to the 
drawings, in which: 

Fig. 1 is a schematic block diagram representation of a communication network employing multiple protocols; 
Fig. 2A is a schematic block diagram representation of a communication system employing coding; 
Fig. 2B is a schematic block diagram representation of a generic Viterbi decoder in a communication system 
employing coding; 

Fig. 2C is a schematic block diagram representation of a generic turbo decoder in a communication system em- 
ploying coding; 

Fig. 3 is a schematic block diagram representation of a unified decoder; 

Fig. 4 is a schematic block diagram representation of an architecture for a unified decoder; 

Fig. 5A is a schematic block diagram representation of a butterfly processor of Fig. 4; 

Fig. 5B is a schematic block diagram representation of an Add-Compare-Select (ACS) unit of Fig. 4; 

Fig. 6A is a representation of a 32-state trellis and its corresponding butterfly processors and path metrics; 

Fig. 6B shows the resultant path metric locations; . . , 

Figs 7A-7E are representations of the in-place path metric addressing at times t=1 to t=5 respectively; 
Fig. 7F is a representation of the addressing of the path metric columns; 

Figs 8A-8F are representations of the in-place path metric addressing of a reverse trellis configuration; 
Fig. 9A is a schematic block diagram representation of an Intermediate Decoding Memory Processor of Fig. 4; 
Fig. 9B is a schematic block diagram representation of an exploded view of Fig. 9A showing a Window Memory 
Subsystem, Traceback Controller and Interleaver Controller, 

Fig. 9C is a schematic block diagram representation of a Traceback Controller of Fig. 9B; 

Fig. 9D is a schematic block diagram representation of an Interleaver of Fig. 9B; 

Fig. 9E is an exploded view of an Interleaver address controller of Fig. 9D; 

Fig. 9F is a schematic block diagram representation of a Window Memory Subsystem of Fig. 9B; 

Fig. 10A is a schematic block diagram representation of a LogLikeiihood processor of Fig. 4 for a single row 

decoder; 

Fig. 1 0B is a schematic block diagram representation of an Add-Compare-Select Node unit of Fig. 10A; 

Fig. 10C is a schematic block diagram representation of a LogLikeiihood processor of Fig. 4 for an eight row 

decoder; 

Fig. 10D is a schematic block diagram representation of an ACS unit of Fig. 10A; 

Fig. 11 is a schematic block diagram representation of a bank of butterfly decoding processors of Fig. 4; 

Fig. 12 is a schematic block diagram representation of a Reverse Address Processor of Fig. 4; 

Fig. 13 is a schematic block diagram representation of Normalisation Subtracters of Fig. 4; 

Fig. 14 is a schematic block diagram representation of a Comparator of Fig. 4; 

Fig. 15 is a schematic biock diagram representation of a Path Metric Memory of Fig. 4; 



10 



15 



45 



50 



EP1 204 211 A1 

Fig. 16 is a schematic block diagram representation of a Forward Address Processor 
Fig. 1 7 is a schematic block diagram representation of a Comparator (ACS level) of Fig 4- 
Fig. 1 6 is a schematic block diagram representation of an Input Symbol History of Fig 4- ' 
Fig. 19 is a schematic block diagram representation of a LogLikelihood Ratio Processor of Fiq 4- 
Figs^OA and 20B illustrate use of murtiple decoders to implement a single Turbo decoder 

Fig. 22 Is a schematic block diagram representation of four interconnected decoders operating together as a sinale 
decoder for even higher performance decoding (32-state trellises every cycle); and t0 S etheras a ™& 

Figs 23A and 23B are schematic block diagram representations of a non-systematic encoder. 

Detailed Description 



E2 ^lll^T "TTT P ? VidSS 3 " nified d6COder arcn itecture tor witless communication systems. The 
un if ed decoder implements the decoding required for convolution^ encoded and turbo encoded data streams Tha 
mo e S der ri architec h tu - can SU PP°« multiple data streams and muttiple voice S ^7t^ZZfyTur2r- 
mo e, the decoder can be dynamically partitioned, as required, to decode voice streams for different standards The 
preferred embodiment is modular and thus readily scalable. curerent standards. The 

20 JLh LlSn T"T* C 7" 1unication "«*"«>* 1 00- A UMTS base station 1 1 0 contains a transmitter/receiver 

with Z^iSSK t n ?° dU ' e 150a ' ^ Emitter/receiver 112 communicates via a switching network 160 
Sf Thi t™„. If t / ransm,tte ^ ce,ver 146 loc ^ed in a remote base station 140 and containing a decoder module 
Si T*l t an^S! CeWer VL** COmmunicates with a ™W handset 1 60a, which contains a decode ZZl 
Horn communicates with another mobile handset 160f, which contains a decoder untt 

25 ?£ 191 Thebas V tation 140 cont ains further transmitter/receivers 142 and 144, containing decoder units 150d and 
viat^^ 

deader ssLTSri n remote cdma base station 130 containin9 cdma 

JSSi ?L t » Emitter/receiver 142 also communicates with a mobile handset 160d, containing decoder 
so nnuVr^ZT, 132 communicates with » mobile handset 160c, containing decoder unft'so" 

contelnlT^ 

1m T 9 ! mc? SWer 122 and deC0der unit 150t> - ^ transmitter/receiver 144 also communicates with a 

[0021] The decoder units 150a, 150b, 150c, 150d, 150e, 150f, 150g 150h 150i 150i 150k and i«;n m Ia^h 
ss er^ 

£2" T Whfch b6en C ° nfi9Ured t0 10 dWerent cellular network standards 

EH? ,« C ° der arch itecture of Fig. 1 offers telecommunication companies operating multiple network 

standards great benefits In flexibility and cost reduction as the same decoder b.ock can be used to SjEiJSE 
different coding schemes in different network components. implement many 

40 [0023] Fig. 2A shows a typical communication system 200 in which coding is used to improve the transmission of 
noTda* s r m V ranSmitte : 21 o° 10 3 rSCeiVer 27 °- The tranSmitter 21 0 has an "formationTuJ TsS^SSS^ 
Za^XT'l^l 1 " redUndam inf0miati0n " 8dded t0 the input dala stream in a -^n a e 
SlmhSn nf f ? 9 S ° 38 10 ' mpr0Ve the abi ' ity t0 detect and wffKt errore which may occur during the 
,nf0 7 at ' 0n as a result * n°i^ sources present in a communication channel 240. The encoded inpS 
fnfZl then t modu,ated 230 t0 im Press the encoded data streamonto a waveform to be transmitted The encoded 
information .s transmitted over a channel 240, which has many sources of noise 280 acting upon it The channel 240 

an^TaT^ 

an output to a decoder 260, which outputs a received information signal 275 9 
[0024] Fig 2B shows the communication system 200, in which the decoder 260 is a generic Viterbi decoder The 
input to the Viterbi decoder 260 is coded information received from the channel 240. The V^^de^cSd^ 

conS,e m rt n n T ° • (BMC) 2 ^ ^ ° UtpUt * preSented to an add-compare-setect Vo^£?E£ 
controller 290 provides .nputs to the BMC unit 289, the ACS unit 291 and a path metric memory 292 The path meTric 

ACS S 8CtS M 3 fT* bUffer and interc nanges information with the ACS unit 291. A bomow output 294 oflhe 
^ ACS unit 291 ,s presented to a traceback memory and controller 293. whose output is the received information signal 

[0025] Fig. 2C shows a Turbo decoding configuration of the decoder 260 of Fig. 2A. A received symbol in a turbo 

SSSSZSiTT 0 T 1 rePreSentin9 th8 a ° tUal data b6ing transmitted ' and pa *V datl wT^re °eS 
thecodedform of the data being transmrtted. A first input 261, being the parity data of the received symbol, is presented 
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to a demultiplexer 263. A first output 264 of the demultiplexer 263 is presented to a first decoder 266. A second input 
262, being the systematic data of the received symbol, is presented to the first decoder 266. A recursive input 277 is 
also presented to the first decoder 266. The output 267 of the first decoder 266 is then presented to an interleaver 268, 
whose output 269 is presented to a second decoder 271 . A second output 265 of the demultiplexer 263 is also presented 
5 to the second decoder 271. A first output 272 of the second decoder 271 is presented to a first deinterleaver 274, 
whose output is the recursive input 277. A second output 273 of the second decoder 271 is presented to a second 
deinterleaver 276. The output of the second deinterleaver 276 is presented to a slicer278, which applies a threshold 
to a soft output to convert it to a hard output, being a received information signal 275. 

[0026] The unified decoder architecture of the preferred embodiment is intended to replace the decoder 260 in wire- 
10 less communication systems having both voice and data capabilities and exploits the similarity in the computations 
needed for Viterbi decoding and LOG-MAP Turbo decoding so that memory and processing units are used efficiently 
when configured for either of such schemes. LogMAP is an algorithm which may be utilised in the decoding of convo- 
lutional codes. LogMAP is also used in one half cycle of a turbo decode iteration. Processors within the preferred 
embodiment are stacked together and interconnected using a hierarchical switching structure so that they can perform 
15 independently as separate decoders or, alternatively, they may be combined to form a single high speed decoder, with 
a predetermined processor being dominant. 

[0027] Fig. 3 shows the block architecture of a unified decoder structure 900 in accordance with an embodiment of 
the present invention. A multi-bit input symbol 901 from an alphabet of a prior agreed coding scheme for a particular 
transmission is broadcast to a bank of butterfly decoding processors 920. The bank of butterfly decoding processors 
20 920 also receives as inputs the outputs of a bank of first stores 940. A control unit 960 provides inputs to each of an 
intermediate decoding result memory 91 0, the bank of butterfly decoding processors 920, the bank of first stores 940 
and a bank of a second stores 950. The control unit 960 issues appropriate control signals via the inputs to implement 
convolutional or turbo coding, as desired. 

[0028] The embodiment depicted in Fig. 3 is that of a single row decoder. When multiple decoder rows are intercon- 
25 nected to form a single decoder, each of the decoder rows in the single decoder is presented with the same mufti-bit 
input symbol 901 . When multiple decoder rows are acting as multiple decoders, each decoder being implemented is 
presented with a separate multi-bit input symbol 901 . 

[0029] The bank of butterfly decoding processors 920 produces first outputs 962, 964, 966 and 968, which are trans- > ' 
mitted via a bus 990 to the bank of second stores 950. Outputs of the bank of second stores 950 are presented as 
30 inputs to the bank of first stores 940. A generic embodiment of the decoder typically uses a bank of first stores 940 . 
and a bank of second stores 950 in a double buffering mode. 

[0030] The bank of butterfly decoding processors 920 produces second outputs 961 , 963, 965 and 967, which are 
intermediate decoding results presented to the control unit 960. 

[0031 ] The bank of butterfly decoding processors 920 and the loop feedback connection via at least one of the stores 
35 form a loop functioning as a trellis processor. 

[0032] The intermediate decoding result memory 910 produces a decoded output 999. The intermediate decoding 
result memory 910 may provide recursive results to the control unit 960 when computing a LogMAP algorithm, as 
described later. 

[0033] Fig. 4 shows the block architecture of a unified decoder 1200 in accordance with a preferred embodiment of 
^o the present invention . A control unit 1 21 0 of the unified decoder 1 200 receives a number of inputs, including rate 1 201 , 

constraint length 1202, convolutional or turbo selector 1203, polynomials 1204, trellis direction 1205, number of itera- 
. tions 1206, block length 1207, clock 1208 and reset 1209. The rate 1201 indicates how much information is used to 

represent a single data bit present in a transmitted block. The constraint length 1202 indicates how many previous 

input symbols are used to encode a presented input information bit and is, thus, also an indicator of the complexity of 
45 the trellis being processed to decode a given input symbol. The polynomials 1 204 are generator polynomial coefficients 

used in the decoding process. The number of iterations 1 206 determines how many loops are executed by the decoder 

1200 when operating in turbo mode. A larger value for the number of iterations 2106 indicates a more accurate decoded 

output 1294 at the cost of increased computational time. 

[0034] The control unit 1210 is interconnected to an Intermediate Decoding Memory and Processor 1240, LogLike- 
50 lihood Processors 1250a and 1250b, a bank of multiplexers 1250c, a Comparator 1247, Butterfly Decoding Processors 
1260, a Reverse Address Processor 1270, Normalisation Subtractors 1 278, a bank of multiplexers 1278a, a Path Metric 
Store 1280, a Forward Address Processor 1290, a LogLikelihood Ratio Processor 1297 and an Input Symbol History 
1298. The Control unit 1210 is able to reconfigure the architecture of the unified decoder 1200 via these connections 
to implement either a convolutional decoder or a turbo decoder, as desired. 
55 [0035] Input symbols 1299 are presented to an Input Symbol History 1298, which functions as a double buffer to 
ensure that a constant data flow is maintained. The Input Symbol History 1298 also receives an Input Symbol History 
Bank Select 1211, an Input Symbol History Address 1219, an Input Symbol History Clock 1223 and an Input Symbol 
History Reset 1225 from the control unit 1210. The Input Symbol History 1298 produces a first output 1291a, which is 
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p£££ ?aSST fly DeC ° din9 Pr0CeSS ° re 126 °' 3 SeC ° nd ° U,PUt 1291b " b presented to ^Likelihood 

p 0361 IH! BUtt o rfly Decodin 9 Processors 1260 also receive as inputs reverse trellis path metrics 1265 from the 
ETSiFS" Pr °T S ° r S 270t eXtnnSiC int0rmati0n 1242 from tha intermediate CSKiSTJ?? £ 

rs, c^n D ^irs;z^^^ — in 

[0040] Fig. 5A shows an exemplary butterfly unit of the butterfly processors 1 260 of Fiq 4 havino two Add c mna „ 
Select units 320 and an intermediary Branch-Metric calculator 330. Each of the mS^SSSS^SSIL 

maZfir T T metriC "° 1 2653 and inpUt Path metric ' 1 1 265b - Tba ""Put SymbonSTa and exit sic in^ 
mation 1 242 are broadcast to each of the Branch Metric Calculators 330 in the bank of butterfly prrcessoS EmtE 

XSSS ST 0 calculator is also presented with a butterf * rate 1 21 a SESSSS^S 2 

miUi h!f h Sta !f !" 8 C °' Umn ° f 3 tre,liS hSS 3 pair 0f branch metrics leadi "9 to it. Each of the individual branch 
22? V T!°' f asSOC,ated with «■ Therefore, when navigating a trellis in a given direction one o^o oossS 

symbols is expected for a state under consideration, depending on the previous states "The BMC 4 «2^SS ! 
measure proximity of the receive^ 

branch metric-0 406, which is presented to a first ACS unit-0 320 and a second ACS lunft V 32 X ffh.^ 

aTs u„!m 320^ a 2?2 7" ^ 402 * PreSS " ted l ° the fifSt ACS Unit "° 32 2nST2SS 
to ronTn 1 t f ' S a ' S0 m b,tS Wide ' A butterf| y mode 1221 fe Presented to each of the ACS units 320 

[0042] Each of the ACS units 320 generates two outputs which, for ACS 0 in Fig. 5A. consist of a first outout 1 255a 

^^SS.? 7 '- Th T firSt 0UtPUt 12553 ' S 3 deCiSi ° n bit WhiCh iS the ValUe 0f »• ^ SZEZSS. 
nctaat ng which of the upper or lower potential path-metrics is selected. A decision bit with a value of 0 ™rr W „I„ 

to the lower potential path metric being selected, whereas conversely a value of 1 «^^^££"SS 
ireuis aiagram. ACS 1 produces corresponding outputs 1255b and 1267b 

[ °^L/ i9 58 Sh ° WS 3 " architectu re of an ACS unit-0 320 of Fig. 5A. Two pairs of inputs 402 and 1265b and *m 

Z Z:r^t^ P r* e f h dd6rS 410 ^ 412 ' Th6 P3ir °' inpUtS branch melnS 

12S5a Th! 1 t ? . , ' er33S Se ° 0nd pair 0f inputs consists of brancb rnetric-0 406 and path metric-0 
InfnH f ♦ elementS ° f 6aCh ° f the input P airs are added in respective adders 410 and 41 2 thT coTre 

spending outputs 411 and 413 of the adders 410 and 412 being presented to a Full Subtracter 41 4 T^e outputs i7l 
and 41 3 are also presented to a first two-to-one multiplexer 420. A borrow output 1 255a of the FuH Subt™ S li VI 
fed to the first multiplexer 420 to compute a maximum MAX of the input ^^Iu^bS^^^^I^ 

«^ZTnaTL ACS Unit 'I 0 ' With 3 V3 ' Ue ° f ° indiC3ting th3t the l0Wef Path metric hi beefchose anTa va ue 
th J rt t UPP l P3th metri ° h3S b3en Se,e0ted - AseCond o^put 415 of the Full Subtracter^ 4 represent no 

SLt of * ^ 3ddSr r6SU,tS 41 1 3nd 41 3 ' is presented to a L °9- sum c °rrection table 440 which adfusls he 

result of the new path metric, when the output of the Full Subtracter 414 is small, to produce a more atcurSe resuS 
in the Log domain for LogMAP decoding. An output 441 of the Log-sum correction table 44 fi pSsenSo a VmT 

IS' T °TL?! ° f ,hS ,irSt mU,tiP ' eXer 420 is presen,ed to the Addar 46 ° to a second Ktto-one nSHZ 
450. A result 461 from the Adder 460 is then presented as a second input to the second multXe^oTcSntro 

whether the Viterb, or LogMAP coding scheme is being implemented. The second multiplexer 450 forms an outout 

a t,Zl rZA 30 ACC " mU,a,e R39iSter 470 3 " d 3 fUrth6r mU,tip,exer 480 - The Accumulate tSSSihX 
a butterfly reset 1215 and produces an output 472 to the multiplexer 480. The multiplexer 480 receives a beta-phase 
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enable 1235 as a select signal that selects the output 451 when inactive and the output 472 from the Accumulate 
register 470 when active. The selected output of the multiplexer 480 is the outp ut path metric 1 267a of the ACS unit 320. 
[0044] The bank of multiplexers 1250c receives a select signal 1258 from the control unit 1210, which is used to 
select either the butterfly path metrics 1266 and 1267 output from the Butterfly Processors 1260 or the path metrics 

5 produced by the LogLikelihood Processor-0 1250a and LogUkelihood Processor- 1 1250b. During a Viterbi calculation, 
the butterfly path metrics 1266 and 1267 are selected. In the first phase of a LogMAP computation, butterfly path 
metrics 1266 and 1 267 are chosen whilst the Butterfly Decoding Processors 1 260 compute gammas and alphas. Con- 
temporaneously, LogUkelihood Processor 1250a calculates dummy betas. At the end of the first phase, the path metrics 
produced by the LogLikelihood Processor-0 1250a are selected by the bank of multiplexers 1250c to be broadcast to 

10 enable the calculation of betas in the second phase of the LogMAP computation. 

[0045] The bank of multiplexers 1250c outputs new path metrics on Lower Path Metric Bus 1295 and Upper Path 
Metric Bus 1296. The buses 1295 and 1296 are connected to LogLikelihood processors 1250a and 1250b, a bank of 
multiplexers 1278a and a Forward Address Processor 1290. 

[0046] The Forward Address Processor 1290 receives a Forward Trellis Select 1232, a Forward Trellis Hold 1234, 
15 a Forward Trellis Transparent Bit 1236 and a Path Metric Input MUX Select 1238 from the control unit 1210, which are 
used to configure the Forward Address Processor 1290 in accordance with whether the unified decoder 1200 is being 
used to navigate a trellis in the forward or reverse direction. 

[0047] The Forward Address Processor 1290 orders the new path. metrics received on buses 1295 and 1296 such 
that an apparently sequential list of path metrics is presented to the butterfly processor 1260 for computation of the 
20 next column of the trellis, when the trellis is being navigated in the forward direction. When a trellis is being navigated 
in the reverse direction, the Forward Address Processor 1290 acts transparently. 

[0048] The Path Metric Store 1280 receives addressing information ADDR0 1228a and ADDR1 1228b, Path Metric 
Reset 1 230 and Path Metric Read/Write Clock 1 231 from the control unit 1 21 0, in addition to forward trellis path metrics 
1285, which are output from the Forward Address Processor 1290. The Path Metric Store 1280 outputs stored path 

25 metrics 1276 to a bank of multiplexers 1278a and to LogLikelihood Processors 1250a and 1250b. 

[0049] The bank of multiplexers 1 278a is used as an interconnect point for multiple decoder row configurations, and 
receives stored path metrics 1276, a control signal 1278b from the control unit 1210, and new path metrics on buses 
1295 and 1296. The bank of multiplexers 1278a allows the initialisation of the beta computation during LogMAP cal- 
culation and produces an output 1277 to Normalisation Subtractors 1278. 

30 [0050] A comparator 1247 receives the butterfly path metrics output on buses 1266 and 1267 from the Butterfly 
Decoding Processors 1260 and determines a maximum new path metric. This maximum new path metric is then com- 
pared with a stored maximum path metric and the greater of the two values is presented as normalising output 1246, 
which is sent to the Normalisation Subtractors 1278 and the Intermediate Decoding Memory and Processor 1240. 
[0051] The Normalisation Subtractors 1278 receive the output 1 277 from the bank of multiplexers 1278a and subtract 

35 the Normalising Output 1 246 to ensure that the path metricsare contained within the dynamic range of the architecture. 
The normalised path metrics 1 275 are output and presented to a Reverse Address Processor 1 270 and LogLikelihood 
. Processors 1250a and 1250b. The Reverse Address Processor 1270 also receives as inputs-LogLikelihood Enable 
1214, LogLikelihood 0 Enable 1203 0 and LogLikelihood 1 Enable 1203 1( Reverse Trellis Select 1222, a Reverse Trellis 
Hold 1 224 and a Reverse Trellis Transparent Bit 1 226 from the control unit 121 0. The inputs from the control unit 121 0 

40 are used to configure the Reverse Address Processor 1270 appropriately, depending on whether the decoder 1200 is 
traversing a trellis in the forward or reverse direction. The output of the Reverse Address Processor 1270 is presented 
as reverse trellis path metrics 1265 to the Butterfly Decoding Processors 1260. 

[0052] The Reverse Address Processor 1270 orders the normalised path metrics such that a desired sequence of 
path metrics is presented to the butterfly processor 1260 for computation of the next column of the trellis, when the 
45 trellis is being navigated in the reverse direction. When the trellis is being navigated in the forward direction, the Reverse 
Address Processor 1270 acts transparently. 

[0053] The LogLikelihood Processor 1 250a receives a LogLikelihood Mode 1 21 4a, reverse trellis hold 1 224a, reverse 
trellis transparent bit 1226a, a LogLikelihood rate 1248a, a LogLikelihood constraint 1249a, a LogLikelihood clock 
1 251 a, a LogLikelihood reset 1 252a, LogLikelihood polynomials 1 253a, LogLikelihood 0 Enable 1 203aQ, LogLikelihood 

so Enable 1203a 1( reverse trellis select 1222a, and select signal 1258a from the control unit 1210. The LogLikelihood 
Processor 1250a also receives as inputs the normalised path metrics 1275, the output 1291b from the Input Symbol 
History 1 298, stored path metrics 1 276, new path metrics on buses 1 296 and 1 295 and interleaver extrinsic information 
1256. The LogLikelihood processor 1 250a produces a first output 1245a, which is presented to a LogLikelihood Ratio 
Processor 1297. The LogLikelihood processor 1250a also presents inputs 1266* and 1267' to the bank of multiplexers 

55 1250c. 

[0054] A second LogLikelihood processor 1250b receives corresponding inputs 1214b, 1224b, 1226b, 1248b, 1249b, 
1251b, 1252b, 1253b, 1203b 0 , 1203b 1( 1222b and 1258b from the control unit 1210. The LogLikelihood Processor 
1250b also receives as inputs the normalised path metrics 1275, stored path metrics 1276, interleaver extrinsic infor- 
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%^l£L m V*?- nm ' Path metriCS °" bUSSS 1296 30(1 1295 ' The LogLikelihood processor 1250b produces an 
output 1 245b, which is presented to the LogLikelihood Ratio Processor 1 297 

K2?i„ HIT L ° 9L * el ! h00d P^ssor 1250a is used to compute dummy betas in the first phase of a LogMAP calcu- 
lation. In the second phase of the LogMAP calculation, LogLikelihood Processors 1250a and 1250b are used in co^- 
juncjon wfl, the Butterfly Decoding Processors 1260 to create a LogLike.ihood result for a "1" and £ re^ectivTy 
[0056] The mtermediate Decoding Memory and Processor 1 240 acts as a bufferfor producing output during a Viterbi 
ten^ore for T ' L ^. co ^^ »• Intermediate Decoding Memory and" Processor 1240 acts as a ex- 
ended s ore for the path metnc store 1280. The Intermediate Decoding Memory and Processor 1240 receives an 

ir^^^£ 12, an "ts? e Decoding Direction i23? - a sp ^ eading ,nput 1243 > «r c o a ck 

. 9 S ' 9 " 1254 fr ° m th9 COntr °' Unit 121 a The Intermediate Decoding Memory and 
~ 240 a *>™ eives the Normalising Output 1 246 and Decision Bits 1 255. The Intermediate Decoding Mem 

RaTproc^ 

S2^T„n T a T M6,VeS 30 ' npUt 1293 tr0m the LogLikelihood Ratio Processor 1297. The Intermediate 

sorTlS ISJXS. " Pr ° dUCeS """^ inf0rmati ° n 1256 10 LogLikelihood Procel 

!So Jfh™V k f lih t°f,?n ati0 P [! CeSSOr 1297 reC6iVeS 3 Hard ° r S0ft ° ut P" t Se,ect 1213 and Spreading Input 
an?i250b HI h » add o 0 " 10 0UtpUtS 1 2453 Snd 1245b fr0m 1,16 LogLikelihood Processors 1 250a 

^11, ^ LogL.kel.hood Ratio Processor 1297 also receives as inputs the extrinsic Information 1242 of the 
Intermediate Decoding Memory and Processor 1 240 and Scramble Address Data 1286. The LogLikelihood Ratio Proc- 

P^essor 124o" Pr ° dUCeS ' ^ " 1293 t0 the '"«■""—» Decoding Memo^ and 

5?lLlS£££ ^ ^ 1245b rSpreSent ^probability of the decoded output being a "1 - ora"0», respectively. 
The LogL.ke .hood Ratio Processor 1 297 performs a subtraction of the outputs 1 245a and 1 245b in the log domain 

Decoded Output 1294. The LogLikelihood Ratio Processor 1297 also subtracts the outputs 1245a and 1245b and the 
extnns.c information 1242 to produce the output 1293. which represents new extrinsic formation 
[0059] A code of maximum constraint length k produces a trellis diagram with 2™ states. Fig 6A shows a 32-state 
at Am' ? TZ" 1 °°° , ?°r SPOndin9 10 8 COdS naVi " 9 a maXimum ™ M te "9* * 6. Each of the 32 states 1 Jo2 

S Sm! J L T P T 8 branCh mStriCS mappin9 10 ° ne ° f 32 states 1004 attime St + v F °r e^P'e. state 0 1003 
at time S t has branch metncs 1 006 and 1 008 leading to state 0 1 009 and state 1 6 1 007 at time S, 

oMh«L^li 2 ," St f raW K tre " iS diaQram 1000 may be re P resentsd fa V 16 corresponding butterf7y + connections 1010 
of the same frell.s. I can be seen that pairs of states in one column 1012 of the trellis map to corresponding pairs of 

□2 mLT ^ Th • , aS I Sh0 T n in R9 4 ' f ° Ur bUtterf ' y P rocess ° r « 1260 are provided. This allows 8 resultant 
path metnc locations to be calculated in each clock cycle. 

S,^t 9 ' 6B h Sh T the ras " ltant P^h metric locations 1014 for a 32-state trellis diagram. The 32 resultant path 
path metric locations produced by four butterfly processors 

S^Rat 7a" \T?c° 7 n T^T* T^' SUM ' S 0 P erati " ns - each of w "ich corresponds to a single clock 
Xaca ; Iaa I ^° W u he pr ° CeSS by Whi0h 3 preferred embodiment of the invention implements 

ZTn^lT ^ ff reSS ' ng - F '9 J A shows time 1=1 • corresponding to the first sub-trellis operation, in which eight 
new path metncs 1112 are presented as inputs. New path metrics 1112 representing 0, 1, 2 and 3 are written into a 
f irst column of memory 11 02, corresponding to upper memory blocks B0 of Path Metric Store 1 280. whilst path metrics 
SoIkI 19 arawr,tten into four holding registers 1114. Path metrics 16, 17, 18 and 19 are held for a clock cycle 

tht n!w 9 . ?" 1 me T 0ry ' 33 thS memPry '° cati0ns t0 which the y wiM be wri «en will not become available until 
the next clock cycle when the new path metrics for trellis states 8 to 1 5 have been calculated 

5ULh T ne fo C J 0Ck 0X0,6 1=2, Sh ° Wn in Fig - 7B ' 3 furtner eignt new P atn metrics 1122 ara Presented as inputs. 
The path metncs 1122 corresponding to new path metric locations 4, 5, 6 and 7 are written into a first column of memory 
1 04. corresponding to lower memory blocks B1 of Path Metric Store 1280. The contents of the holding registers 1114 
are written into a second column of memory 1102, corresponding to Path Metric Store 1280 B0 and the new oath 

regrsters C ?iU P ° ndin9 t0 P3th metri ° '° Cati0nS 2 °' 21 ' 22 a " d 23 arS Written in 33 the new con,ents of » e "olding 
[0064] In the third clock cycle shown in Fig. 7C, a further group of new path metrics 1 1 34 is presented. The new path 
memcs corresponding to states 8. 9. 10 and 11 are written into a third column of memory 1102, corresponding to Path 
Metnc Store 1280 B0 and the contents of the holding registers 1114, being states 20, 21, 22 and 23. are written into a 

fnTtri^T/l 3 TT! 04 ' COrres P° ndin 9 t0 Patn Metric Store 1280 B1 . The fournewpath metrics correspond- 
ing to states 24, 25, 26 and 27 are written into the holding registers 1114. 
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[0065] Fig. 7D shows the fourth clock cycle, during which the final eight new path metrics 1144 are presented. The 
new path metrics corresponding to states 12, 13, 1 4 and 1 5 are written to a third column of memory 1104, corresponding 
to Path Metric Store 1 280 B1 , the contents of the holding register corresponding to states 24, 25, 26 and 27 are written 
to a fourth column of memory 1102, corresponding to Path Metric Store 1280 B0, and the new path metrics corre- 

5 sponding to states 28, 29, 30 and 31 are written to holding registers 1114. 

[0066] An additional clock cycle corresponding to t=5, as shown in Fig. 7E, is required to write the contents of the 
holding registers 1114 into a fourth column of memory 1104, corresponding to Path Metric Store 1280 B1 . 
[0067] Fig. 7F shows a representation of the addressing of the path metric columns for a 32-state trellis in accordance 
with a preferred embodiment of the present invention. The addressing sequence of the path metric columns 1150 

10 corresponds to the read/write addresses of the path metric columns. Each row of the table 1160 corresponds to a 
different column of a trellis diagram, representing Symbol time n (S n ), Symbol time n+1 (S n+1 ) and Symbol time n+2 
(S n+2 ). It is evident that the movement of the addresses of the path metric columns is periodic. 
[0068] Figs. 7A-E have shown the progression from S n to S n+1 . The next clock cycle, t=6, will begin the transition 
from S n+1 to S n+2 and columns 0,2, 1,3 will be executed in order to present a sequential list of states to the ACS units. 

15 [0069] Figs. 8A, 8B, 8C, 8D, 8E and 8F show the process by which a preferred embodiment of the invention imple- 
ments in-piace path metric addressing during navigation of a reverse trellis. Fig. 8A depicts the notation that will be 
followed in Figs. 8B-F. Fig. 8B shows time t=1 , corresponding to the first sub-trellis operation. The path metrics resident 
in the first column of memory A, C 0A , have been shifted to a hold register 3010. In Fig. 8C, time t=2, the first column 
of memory B, C 0B , is moved to the hold register 301 0 and a function of C OA and form resultant path metrics C OA ' 

20 and C 0B ., which are written into the first column of memories A and B respectively. In Fig. BD, the second column of 
memory A C 1A is deposited in the hold register 3010. A function of the previous contents of the hold register C 0B and 
C 2B form new path metrics C 1A * and C 1B ' which are written back into the third column of A and B respectively. 
[0070] Fig. BE shows time t=4 in which C 1B is written to the hold register 3010, A function of C 1A and C 3A produces 
new path metrics and C 2B ', which are written into the second columns of memories A and B respectively. Fig..8F 

25 shows the reverse sub-trellis operation corresponding to time t=5 in which a function of C 1B and C 3B forms resultant 
path metrics C 3A and C ZB K which are written into fourth columns of memories A and B respectively. In the calculations 
of the reverse trellis, path metrics are presented to four butterfly processors in a scrambled manner and the in-place 
path metric addressing described in Figs 8B to 8F ensures that the resultant path metrics are presented in a sequential 
manner. 

30 [0071] Fig. 9A shows a high level schematic block diagram representation of an embodiment of the Intermediate 
Decoding Memory and Processor 1240, which performs traceback and interleaver functions in respective decoding 
schemes. The Intermediate Decoding Memory and Processor 1240 receives as inputs decision bits 1255, normalising 
output 1 246, spreading input 1243, intermediate decoding direction 1237, intermediate decoding mode 1212, a clocking 
signal 1254, a read/write clock 1257, a reset signal 1259 and the output 1293 from the LogLikelihood processor 1297. 

35 The Intermediate Decoding Memory and Processor 1240 produces outputs including extrinsic information 1 242, inter- 
leaver extrinsic information 1256 and Traceback processor output 1567. 

[0072] -Fig. 9B shows an exploded view of the Intermediate Decoding Memory and Processor 1240. A Traceback- 
Address Controller 1510 receives as inputs Decision Bits 1255, the Intermediate Decoding Direction 1237, Normalising 
Output 1246, the clocking signal 1254, reset signal 1259, read/write clock 1257 and the Intermediate Decoding Mode 

40 1212, which is inverted. The Traceback Address Controller 151 0 produces an output 1567. 

[0073] The Traceback Address Controller 1510 writes Decision Bits 1255 to a Window Memory Subsystem 1520 
every clock cycle. During traceback, the Traceback Address Controller 1510 examines a trellis section to determine a 
biggest value to be used as a starting point. It is to be noted that it is not necessary to store the complete value for 
each state as a new traceback byte address can be generated using one of the Decision Bits 1255. 

45 [0074] An Interleaver Controller 1 520 also receives a clocking signal 1 254, reset signal 1 259, read/write clock 1 257 
and Intermediate Decoding Mode 1212. In addition, the Interleaver Controller 1520 receives the output 1293 from the 
LogLikelihood Ratio Processor 1297, the Spreading Input 1243 and the Intermediate Decoding Direction 1237. The 
Interleaver Controller 1 520 produces extrinsic information 1 242 and 1 256. The extrinsic data 1 242 is used as a recursive 
input to the Butterfly Processors 1260 when the decoder 1200 operates as a Turbo decoder. 

so [0075] The Interleaver Controller 1520 produces extrinsic information 1242 and 1256 at the beginning of every clock 
cycle. At the end of every clock cycle, the Interleaver Controller 1 520 receives new extrinsic information in the form of 
the output 1293 from the LogLikelihood Ratio Processor 1297 and writes it into memory. 

[0076] The Traceback Address Controller 1510 and Interleaver Controller 1 520 are interconnected and supply a joint 
read/write signal 1515 to a Window Memory Subsystem 1530. The Traceback Address Controller 1510, Interleaver 
55 Controller 1520 and Window Memory Subsystem 1530 are further interconnected by a bi-directional data bus 1526 
and an address bus 1525. The Interleaver Controller 1520 has a second address bus 1535 connected to the Window 
Memory Subsystem 1530 and the Window Memory Subsystem 1530 produces an output on a second data bus 1536 
to the I nterleaver Controller 1 520 . 
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Fig ^ C f s ^ s the Traceback Processor 1510. The decision bits 1255 are presented to a first multiplexer 
S'JS^ ^e mulfplexer 1 550 is presented to a decisions register 1 555. The output of the decisions registe 

inn^ IS , f ' TT ,Spresented 38 an 0Ut P ut of *• Traceback Processor 1510 and is also fed back as a recursive 
input of the first multiplexer 1 550 and as an input to a bit select 1 558 recursive 

EL t«n e !2! ernle 1 a i t L D n eC , 0din9 DireCti ° n 1237 fe Panted ^ an input to an address translation unit 1 560. The 
address translation unrt 1560 also receives a read/write clock 1257 and produces an output address 1525 and a reatf 
wnte s,gnal 1515. The read/wnte clock 1257 is also presented as the select of the first multiplexer 1550 

562 L D tTJ^T m9 ° UtPU ! ! 24 ! fe PreSemed aS a " inpUt 10 a state re 9 ister 1562 - The °*put of the state register 
12 tZL Z .f TVL? 8 9ddreSS tranS ' ati0n Unit 1 56 °- as we " as bei "9 an in P«t to a previous state unit 
PnSJor SSJIST " PreSentS "° inPUtS 10 3 Se ° 0nd mUltipleXer 1566 - Wh0Se 0U,put is ^e Traceback 
££7«J? ° UtpUt °l the bitse,ect 1558 is P-sented as an input to a first AND gate 1568. The output of the AND 

f^onH AMn Pr ? e r« Q aS K " ' nPUt 10 thS State rS9iSter 1562 ^ output of the bit select 1558 is also presented to a 
second AND gate 1 569, whose output is also presented to the state register 1 562 

[0081] The Intermediate Decoding Direction 1237 is presented as the second input to the first AND gate 1568 and 
^TaT^T^™ 1S66 ' dSCOde "* ° UtpUt 1502 iS a,S ° P ' eSented - via a NOT 9 g a aVSo a to 
l^nLn 5 !° f «r th . S interleavercont rollerof 1 520 of Fig. 9B. The Intermediate decoding mode 1 21 2 is presented 

S hIm,!, T ^ We Spreadln 9 ,n P ut 1243 - tri-state buffer 1582 also receives as a input a read/ 

7 ? ?7 I f ^ S6C0 ^ d tn " State bUffSr 1583 reCeiVSS the output 1293 from ^e LogLikelihood ratio processor 
1297 as Its second.nput. The output 1293 from the LogLikelihood Ratio Processor 1297 is also presented as ZZ 

LTtoZ ° '? ^ c T d 1 ?u Spreadi " 9 inpUt 1 243 iS P resented ,0 ««* of *e logic blocks 1 584 and 158? 
as is me reset signal 1259. and the clock signal 1254. The Interleave 1 520 receives data bus 1526 as an input and 
presents a correspondmg output being extrinsic information 1242. A second data bus 1536 is output as interleave 

1526 fs C th n e oZT^ T ^T^ 1526 iS bi - directiona| . and ^e output of the .nterieaver iS ~*?2ZZ 
1 526 is the output of the tri-state buffer 1 583. 

KLT^ 7 " rSt J° 9iC , bl0Ck 1584 rSCeiV6S the '"Mediate decoding mode 1212 and the intermediate decoding 
Sf" prod " ces an address 1 525. The second logic block 1586 also receives the intermediate decoding 
ST.. '"termediate decoding direction 1237 and produces address 1535. Each of the logic blocks 1584 and 

1 586 also receive an input beta_d, which is a low or high power signal 

E'lf Sh ° WS ^ eXP i° ded ViSW ° f the ,09iC bl0Ck 1584 of Fi 9 ure 9D - "^e logic block 1 586 of Figure 9D has 
the same configuration. A wmdow count 1590 receives as inputs a reset 1259, a clock 1254 and an enable 1212 It 

toa° d ^ 

* • J he flrSt adder 1592 receives a s a second input the constant 1599 and presents its output 

to the window count 1590. The bit count 1 591 recedes as inputs the reset 1259. the Cock 1254, the enable 1I12 Td 

1594 K ' addar 1594 ' The bit count 1591 P'°°uces an output which is presented to two adders 1593and 
I ~ SU T£- B J Pre ! ented 0 an e ' emen t 1595, which adds 1 and if beta.d is active, it negates the value and presents 

^bLunt ?Sl 3dder 1594 ' ° UtPUt ° f the 3dder 1594 iS prBSented as a cursive input to 

C, T | ,S 0utput ° f the second adder 1593 is presented as an input to a multiplexer 1596 and to a scramble 1597 
The multiplexer 1596 recerves a select signal indicating if the architecture is operating as a first or second decoder 

™f f^h? 9 ° UtPUt ° f thS SCramb ' e 1597 - ne output of * e multiplexer 1596 is the address 1525 

The scramble 1 597 deceives the Spreading Input 1 243 as an enabling signal and the output 1 293 from the LogLikelihood 
Ra bo Processor 1297 as date. The scramble 1597 could be memory or logic function as is well known in the art and 
caSatJonr scrambl,n 9 of addresses between a first and second decoder when undertaking Turbo decoder 

[0086] Fig. 9F shows a schematic block diagram representation of the window memory sub system 1530 of Fia 9A 
A read/wnte clock 1515. address buses 1525 and 1535, and data buses 1526 and 1536 are presented* a window 
address decoder 1530a and window memories 1530b...1530d. 

LnH? F ' g J 0A ^°Z S Lo 9 Lik e |ih °°° Processor 1250a of Fig. 4. A bank of four butterfly units 1410 is provided 
fill Thh " o S UnitS 1412a " 1412h are Presented with pairs of reverse trellis path metrics 1415a.. h from a 
Reverse Address Processor 1270b and stored path metrics 1276 from the Path Metric Store 1280. The stored path 

T^l , k n re f T e o?J PhaS in thS L ° 9MAP ca,culation - Each of the ACS units 1412a...h is also presented with a 
LogLikehhood Mode 1214a, a LogLikelihood Clock 1251 a and a LogLikelihood Reset 1252a. BMC units 1414a 1414d 

a 249 a aC th!T ?t ! n oT ber ° f iRPUtS ' indUdin9 the L °S Lik e'ihood Rate 1248a, the LogLikelihood Constraint 
?*?h £. ^Likelihood Polynom«l8 1253a. interieaver extrinsic information 1256 and the Input Symbol History 
1291b. The ACS un,ts 1412a...h produce first outputs 1413a...1413h. which are presented in sequential pairs to the 
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ACS node units 1420a...1420d. The ACS units 1412a...h produce second outputs 480a.. .h, each of which is presented 
to a corresponding normalising subtractor 1470a... 1470 h. The normalising subtracters 1470a.. .1470h produce outputs 
1266' and 1267', which are fed recursively via multiplexers, as explained below, to the Reverse Address Processor 
1270b and used to ensure that the path metrics remain within the dynamic range of the architecture. 
[0068] Each of a first bank of multiplexers 141 7a...h receives a corresponding normalised path metric 1275a...h from 
the Normalising Processor 1 278 and a select signal 1 258 from the control unit 1210. Multiplexers 1 41 7a.. .d also receive 
corresponding path metrics 1296a. ..d, and multiplexers 1417e...h receive corresponding path metrics 1295a...d. The 
path metrics 1295a...d and 1296a.. .d represent betas in the LogMAP calculation. The select signal 1258 is used to 
determine whether the normalised path metrics 1275a...h or the path metrics 1295a...d and 1296a.. .d will be output. 
[0089] Each of a second bank of multiplexers 141 6a.. .h receives LogLikelihood Mode 1214a as a select signal and 
a corresponding output from the first bank of multiplexers 1 41 7a. ..h. Multiplexers 1 41 6a.. .d receive a third input, being 
the output 1266' of the normalising subtracters 1470a..d and multiplexers 1416e..h receive the output 1267* from the 
normalising subtractors 1470e...h. The outputs from the multiplexers 1416a..h are presented as inputs to the Reverse 
Address Processor 1270b. 

[0090] The Reverse Address Processor 1270b also receives a LogLikelihood Mode 1214a, Turbo enable for Log- 
Likelihood 0 Enable 1203a 0 , Turbo enable for LogLikelihood 1 1203a 1t reverse trellis selector 1222a, reverse trellis 
transparent bit 1226a and the reverse trellis hold 1 224a. The beta outputs 1266' and 1267* of the LogLikelihood Proc- 
essor 1 250a represent the final dummy beta values used for the start of the beta processing phase, when the decoder 
1200 is operating in LogMAP/turbo mode. 

[0091] The outputs of the ACS node units 1420a and 1420b are presented to an ACS node unit 1430a and the 
outputs of the ACS node units 1420c and 1420d are presented to an ACS node unit 1430b. The outputs of the ACS 
node units 1430a, 1430b are presented as inputs to a further ACS node unit 1440a, whose output is presented to a 
multi-row comparator tree, which spans the decoder when operated in a multi-row configuration so as to capture the 
maximum path metric being calculated for the state of the trellis being investigated. An output from the multi-row com- 
parator tree is presented to a subtractor 1 450 and a register 1 460. The subtractor 1 450 also presents a recursive input 
to the register 1460. The register output 1245a is fed to the subtractor 1450 and to each one of the normalising sub- 
tractors 1470a...1470h, in addition to being an output of the LogLikelihood Processor 1250a. 

[0092] Fig. 10B shows one arrangement of the ACS node unit 1420a of Fig. 10A. The outputs 1413a and 1413b 
from the ACS leaf units are presented as inputs to a comparator 1474 and a multiplexer 1476. A borrow output of the 
comparator 1474 is fed as a select signal of the multiplexer 1476. A difference output of the comparator 1474 is pre- 
sented as an input to a log sum correction table 1478. The output of the log sum correction table 1478 is presented to 
an adder 1480, whose second input is the output of the multiplexer 1476. The adder 1480 computes and outputs the 
sum 1425a of its two inputs, the sum 1425a representing the maximum of the two inputs 1413a and 1413b, with a log- 
sum correction. 

[0093] Fig. 10C shows a configuration of a LogLikelihood processor 1250a of Fig. 4 for an eight row decoder em- 
bodiment. The LogLikelihood processors 1250a' in each of the rows are interconnected via a bank of multiplexers 
1490. Each multiplexer 1490 presents a single input to a LogLikelihood processor 1250a' in its corresponding decoder 
row. Pairs of LogLikelihood processors 1250a' present their outputs as inputs to ACS node units 1420a 1 , 1420b', 1420c' 
and 1420d'. The outputs of the LogLikelihood processors 1250a* are also presented as recursive inputs to the bank of 
multiplexers 1490. The ACS nodes units 1420a', 1420b', 1420c' and 1420d* are paired and present their outputs as 
inputs to further ACS nodes units 1430a' and 1430b'. The outputs of the ACS node units 1420a', 1420b', 1420c' and 
1420d' are also presented as recursive inputs to the bank of multiplexers 1490. The ACS node units 1430a' present 
their outputs to a final ACS node unit 1 440' and as recursive inputs to the bank of multiplexers 1 490. The output of the 
final ACS node unit 1440' is presented as a final recursive input to the bank of multiplexers 1490. Each multiplexer 
1490 is presented with a select signal. 

[0094] Fig. 10D shows one useful architecture of an ACS unit 1412a of Fig. 1 0A. A first pair of inputs, branch metric 
1 402' and branch metric 0 406', are presented to a multiplexer 408, which produces an output 402". A second pair of 
inputs, path metric 1276a and path metric 1 1415b are presented to a multiplexer 409, which produces an output 403'. 
Each of the multiplexers 408 and 409 receives LogLikelihood Mode 1214a as a select signal. When the LogLikelihood 
Mode 1214a is inactive, branch metric 1 402* is selected by multiplexer 408 and path metric 1 1415b Is selected by 
multiplexer 409. Conversely, when LogLikelihood Mode 1214a is active, branch metric 0 406' is selected by multiplexer 
408 and path metric 1276a, representing an alpha value, is selected by 409. 

[0095] The outputs 402" and 403' of the multiplexers 408 and 409 are presented to an adder 410'. The sum 411' is 
output from the adder 41 0' and presented to a multiplexer 41 6' and a multiplexer 41 7'. The multiplexer 41 7' receives 
branch metric 0 406' as a second input and LogLikelihood Mode 1214a as a select signal. The output 418' of the 
multiplexer 41 T is presented to an adder 412'. The adder 412' receives path metric 0 1415a as a second input. The 
adder 412' produces a sum 41 3', which represents the sum of alphas, betas and gammas. The sum 413' is presented 
to a Full Subtractor 41 4' and a multiplexer 420'. The multiplexer 41 6'receives a hardwired input 407* corresponding to 
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fun SZlf^ l6mem number 31)1610 be represented and a LogUkelihood Mode 1214a as a select signal The 
and ISSSSS^" reC6WeS ,h60Ut P Ut408 ' ° f the mu ft ip.exer4 16 - as a second input and produoJfKlJE 

Ses ^^^S^TS^*' 6 " iS PreS6nted 38 3 firet inDut t0 a ™ ,ti P^ The multiplexer 420" 
theS^ 

unfts BMCO BMC3 9 265h prssented 10 each ° f ACS units ACS6 and ACS7. The BMC 

nSm^SJSiSE h o 9S ' nPUtS eXtrinSiC inf0 ™ ati ° n 1242 and input symbol history input symbol 1291a 
Sbu™e S ^ 



as four butterfly processors: 

(QACSO, BMCO, ACS1; 

(ii) ACS2, BMC1 , ACS3; 

(iii) ACS4, BMC2, ACS5; and . 

(iv) ACS6, BMC3, ACS7. 



[0101] Fig. 1 2 shows the Reverse Address Processor 1270 of Fiq 4 The Reverse Addm<» Pmr« eA r i ota 

^^na^;^T ,h t 00a ? Enabl6 1203 °' 8 Lo 9 Likelih °° d 1 Enable 1203,. andthe normalised pat 

ln?i2?5g Z iSid anS S r *5 ^ ^ PairS ° f inpUtS 12753 and 1275e " 1275b ™* "SuSS 

ana iz75g, and 1275d and 1 275h to a first bank of corresponding multiplexers 1 91 0a3 1 91 0b3 1 Q1 0r9 an rt 1Q , 

and asecond bank of corresponding multiplexers 1915a 19l5d ' 1910c3aneM91 °<»3. 

t0 l 02, -oo? e r6VerSe tre " iS Sele ° t0r 1222 is P resented '0 each of the first bank of XOR gates 1920a d The XOR 
Sl^ 

second bank c XOR o^ 9** 1920a d produces an output which is presented to a corresponding XOR gate in a 
second bank of XOR gates 1 925a...d and to a corresponding one of the multiplexers 1 91 0a3 1 91 0d3 fJ* n f 11 
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1915a...d receives a pair of normalised path metrics The outputs from the XOR gates 1925a...d act as select signals 
for the respective multiplexers 1915a...d to choose one of the presented normalised path metrics. Each of the multi- 
plexers 1915a...d presents an output to a corresponding one of multiplexers 1910b1, 1910d1, 1 91 Ofl and 1910M. 
[0103] Multiplexers 1 910a3,..d3 an 1 91 5a.. .d are presented with different pairs of inputs depending on the values of 

5 the Loglikelihood Enable 1214 and the LogLikelihood Enable 0 1203 0 and LogLikelihood 1 Enable 1203 v LogLikeli- 
hood 0 Enable 1 203 0 is enabled for LogLikelihood Processor 0 and disabled for LogLikelihood Processor 1 . Conversely, 
LogLikelihood 1 Enable 1203 1 is enabled for LogLikelihood Processor 1 and disabled for LogLikelihood Processor 0. 
As the Reverse Address Processor 1270 is used in several locations within the unified decoder 1200, the Reverse 
Address Processor 1270 must be capable of handling different modes of operation. When the LogLikelihood Enable 

10 1214 and LogLikelihood Enables 1203 0 and 1203 t are inactive, the Reverse Address Processor 1270 is in Viterbi 
mode operating on a treilis generated by a non -systematic convolutional code. When LogLikelihood Enable 1214 is 
active, the Reverse Address Processor 1270 is performing reverse trellis switching forthe LogLikelihood operation for 
LogMAP decoding. When either of the LogLikelihood Enables 1 203q and 1 203 t is active with the LogLikelihood Enable 
1214 active, the Reverse Address Processor 1270 is performing switching appropriate for a LogLikelihood operation 

is using a recursive systematic code, as in Turbo decoding. The XOR gates implement the appropriate switching for the 
different operating modes of the Reverse Address Processor 1 270. 

[0104] Each of the first bank of multiplexers 1910a3...1910d3 produces an output which is presented to a corre- 
sponding latch 1910a2...1910d2. Each of the latches 1910a2...1910d2 receives the reverse trellis hold 1224 as an 
input and presents a delayed output as the second input to a corresponding one ofthe multiplexers 1 91 0a1 , 1 91 0c1 , 
20 1910e1 and 1910g1. 

[0105] The reverse trellis transparent bit 1226 Is broadcast to each of a third bank of multiplexers 1910a1 ...1910h1, 
which produce corresponding path metrics 1265a.. .h. The path metrics 1265a.. .1265h are collated and presented as 
reverse trellis path metrics 1265, the output of the Reverse Address Processor 1270. When the decoder 1200 is op- 
erating in the forward trellis direction, the reverse trellis transparent bit 1226 is set such that the Reverse Address 
25 Processor 1270 allows the normalised path metrics 1275 to pass through to become the reverse trellis path metrics 
1265, without alteration. 

[0106] Fig. 13 shows Normalisation Sub-tractors 1278 of Fig. 4. The normalising output 1246 is presented as an input 
to each of the sub-tractors 1610a... 161 Oh. The output 1277 of the bank of multiplexers 1278a is presented as individual 
path metrics 1277a...1277h, each of which is presented to corresponding subtractors 1610a.. .1610h. The outputs 
30 1275a.. .h of the subtractors 1610a...1610h form the normalised path metrics 1275. The normalising subtractors 1278 
are used to subtract the maximum path metric, calculated during the traversal of the trellis and presented as the nor- 
malising output 1 246, from the new path metrics to ensure that the path metric values are retained within the dynamic 
range of the architecture. 

[01 07] Fig. 1 4 shows a comparator 1 247 of Fig. 4, in accordance with a preferred embodiment of the invention. The 

35 butterfly path metrics presented on bus 1267 are fanned out to produce inputs 1267a...1267d to corresponding max- 
imum comparators 1 71 0a. . . 1 71 Od. Similarly, the butterfly path metrics presented on bus 1 266 are fanned out to produce 
, inputs 1266a.. .1266d to corresponding maximum comparators 1710e...1710h. The path metrics 1266a... 1266d and 
1267a...1267d are compared against one another and a maximum path metric 1 715 is output to a multi-row comparator 
tree, shown in Fig. 17, which spans the decoder when operated in a multi-row configuration so as to capture the 

40 maximum path metric being calculated for the state of the trellis being investigated. An output 1716 from the multi-row 
comparator tree is presented to a register 1720, which stores the greatest path metric calculated during the traversal 
of the trellis. The output 1 71 6 is also presented as an input to a subtractor 1730. The register 1 720 provides a second 
input to the subtractor 1730, the input being the greatest path metric calculated thus far during the traversal of the 
trellis. The subtractor compares the greatest path metric calculated during the traversal of the trellis with the maximum 

45 path metric 1 715 and if the maximum path metric 1 715, which has just been calculated, is greater than the greatest 
path metric calculated during the traversal of the trellis, a load signal 1 735 is enabled to the register 1 720 so that the 
maximum path metric 1715 is loaded into the register 1720 to become the greatest path metric calculated during the 
traversal of the trellis. The register provides a further output, being a normalising output 1246, which is fed to the 
normalising subtractors 1278 and to the Intermediate Decoding Memory and Processor 1240. The normalising output 

50 1246 is used to ensure that calculated path metric values remain within the dynamic range of the architecture. 

[0108] Fig. 15 shows a path metric memory 1280 of Fig. 4, in accordance with a preferred embodiment. A path metric 
reset 1230 and path metric read/write clock 1231 are presented to each ofthe memory units 1 81 0a...1 81 Oh. The upper 
memory blocks 1 81 0a...1 81 Od are clustered as B0, and receive an input ADDR0 1 228a. Conversely, the lower memory 
blocks 1810e...1810h are clustered to form B1, and receive a corresponding input ADDR1 1228b. The path metric 

55 store 1280 receives the forward trellis path metrics 1285, which fan out and provide a path metric 1285a,.. 1285h to 
each of corresponding memory blocks 1810a...1B10h, as shown In the diagram. The path metric store 1280 buffers 
the forward trellis path metrics 1285 for one trellis processing cycle and then produces outputs 1276a...1276h, which 
are aggregated and form the stored path metrics 1276. 
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E™ 9; . "^f 1° AddreSS Processor 1290 <* R 9- 4 - i" accordance with a preferred embodiment of 
he present .nvention. The Forward Address Processor 1 290 provides facilities for delaying and ordering path metrics 
to produce a desired pattern of path metrics. The Forward Address Processor 1290 is also capable of acting transpar- 
ent* when the decoder 1 200 is operating in reverse trellis mode such that input path metrics are presented as oJ&L 
wrthout alteration. The upper path metric bus 1 296 is broken into its component path metrics 1 296a...1 296d which are 
presented, as indicated, to two muttiplexers 201 0a and 201 0b, each of the multiplexers receiving two input path metrics 
The lower path metric bus 1 295 is broken into its constituent path metrics 1 295a...1 295d and presented, as indicated' 

ioTcl m Z e r h 2010C 201 ° d ' eaCH ° f thS mUltiPteXerS reCeM "9 in P ut P ath ™* multiSS 

1296d an2 !2Z? Y^T » tre " iS ^ M ' M * S WhiCh ° f the P resented P at " «™trics 1296a... 

l^yba ana 1 295a... 1 295 d is to be selected. 

[0110] Each of the multiplexers 2010a...2010d feeds into a corresponding hold register 2015a. .2015d The hold 
I^ff 2 ° 1 „ 5a K - 2 ° 1 5d6aoh receive an in P ut - bei "9 forward trellis hold 1234. The purpose of the multiplexers 201 0a 
201 Od and the hold registers 201 5a. ..201 5d is to delay certain of the path metrics 1 296a...1 296d and 1 295a 1 295d 
by a clock cycle as part of the in-place path metric addressing 

So 1 L ln E d^ th Tl^l^^ iSt T. 20 ! 5a u-' 2015d Pr ° dUCeS m ° UtpUt Which is P resented to a bank of multiplexers 
SSJ Z ?u ^ mpU ! S t0 thS bank ° f multi P |exers 2 °20 are the constituent path metrics of the upper path 

Z V. :t J ♦ J P metriC 129S ' alS ° 38 Sh0Wn - A path metrio murtip.exer select 1238 is broadcast 

?n?n k Z ? ^ 202 °- The bank ° f multi P' exers 2020 P«>°"<"s outputs to a second bank of multiplexers 
2030, whose other inputs are the constituent path metrics of upper path metric bus 1 296 and lower path metric bus 
1295. A forward trellis transparent bit 1236 is provided to the second bank of multiplexers 2030 and is used to effect 
a transparent path when the decoder 1200 is operating in the reverse trellis mode. The bank of multiplexers 2030 
produces path metrics 1285a...1285h. which are co.lated to form the forward trellis path metrics 1285, beingthe output 
of the Forward Address Processor 1 290. 

E2l?i *2?'. 1 7 "T? Comparator 1247 of Fi 9- 4 - when used in an eight row decoder configuration. The compa- 
rators 1 247 ir . each of the rows are interconnected via a bank of multiplexers 2110. Each multiplexer 2110 presents a 
n iltT ? ? a = 0 ^ es P. ondin 9 comparator 1247' in its corresponding decoder row. Pairs of comparators 1247" 
n r 1 S ' npUtS t0 ACS n ° de UnitS 142 °aM420b», 1420c" and 1420d", each of which spans two 

lH'SfS^ 1 71 5 ° f f/ Comparators 1 2471 are als ° P«««ed as recursive inputs to the bank of 
En.? f ? Zi t° S n ° deS URrtS 14203 1 1420b "' 1420c " and 1420d " are P aired and present their outputs as 
nputs to further ACS nodes units 1430a" and 1430b". The outputs of the ACS node units 1420a", 1420b", 1420c" and 

thl , ar< ; ^ P / eS , e ^f 38 reCUrSiVe inpUtS t0 thS bank 0f multiplexers 21 1 0. The ACS node units 1 430a" present 
her outputs to a final ACS node unit 1 440" and as recursive inputs to the bank of multiplexers 21 1 0. The output of the 
hnal ACS node unit 1440 is presented as a final recursive input to the bank of multiplexers 2110. Each multiplexer 
21 10 is presented with a select signal. luiupiwer 

1 aV 3] ♦<T i9 ' t 1 8 * howsthe confi S ur ation of the Input Symbol History 1298,. including an address controller of Fig 4 
An Inpu Symbol History Address 121 9 is presented as an input to a Window Decoder 221 0, which decodes the address 
to enable access to a first double buffered memory bank-0 221 6 and a second double buffered memory bank-1 221 8 

I^Z w ???J 29 l d0Ub ' e bUfferS reC8iVed inpUt 10 ensure tnat a continuous data flow is maintained. Input 

Symbol H.story clock 1223 and Input Symbol History reset 1225 are presented to a counter 2212, whose output 1297a 
is also presented to the double buffered memory bank-0 2216 and the double buffered memory bank-1 2218. Input 
SZSSaS? are r, r T« 6d , fr0m 3 h ° St Passer ^ a demultiplexer 2214. The demultiplexer 2214 produces an 
° ™ S ? f Sd mem0fV bank "° 2216 and the second out P ut 2226 to 3 double buffered memory bank- 

1 221 8. The demultiplexer 221 4 also receives as an input a read/write signal 1 297b. The read/write signal 1297b also 
feeds a first multiplexer 2220 and a second multiplexer 2222. Each of the double buffered memory banks 221 6 and 

h M S K P « Sen ! 3 R Se ' eCt Signa ' 1211 1 With the bank se,ect s, '9nal 1211 being inverted at the interface to 

double buffered memory bank-1 2218. 

[0 l 14 io^° Uble bUffSred mem0t)f bank "° 2216 P roduc as a first output 2228 to the multiplexer 2220 and a second 
output 2230 to a second multiplexer 2222. Double buffered memory bank-1 221 8 produces a corresponding first output 
2232 which feeds multiplexer 2220 and a second output 2234 which is presented to the second multiplexer 2222 The 
first mult.plexer 2220 produces an output 1291b which is presented as an input to LogLikelihood processor-0 1250b 
rr,, «f °2 mu,t, P ,exer2222 Produces an output 1 291 a which is presented as an input to the butterfly processors 1 26o' 
[0115] Fig. 18 also shows an exploded view of the double buffered memory bank-1 2218. Incoming data 2226 is 
presented to a 1 -to-n demultiplexer 2240, which also receives a window select, being the output of the window decode 
2210. N outputs from the demultiplexer 2240 are presented to n corresponding windows W0...Wn each of which 
produces an output which is presented to a first m-to-1 multiplexer 2242 and a second m-to-1 multiplexer 2244 Each 
of the m-to-1 multiplexers 2242, 2244 also receives a window select input signal. The first m-to-1 multiplexer 2242 
produces the output 2232 which is used for the dummy beta calculations and is destined for the LogLikelihood ratio 
processor-0 1250a. The second m-to-1 multiplexer 2244 produces the output 2234, which is used for calculating alphas 
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and betas in the branch metric units of the butterfly processors 1260. 

[0116] Fig. 19 shows the LogLikelihood ratio processor 1297 of Fig. 4. The LogLikelihood ratio processor 1 297 re- 
ceives inputs 1245a and 1245b, which are output from LogLikelihood processor-0 1250a and LogLikelihood processor- 
1 1250b, respectively. The LogLikelihood ratio processor 1 297 also receives as inputs the extrinsic information 1242, 
s the hard or soft output select 1213, Spreading Input 1 243, the Traceback Process Output 1567 and Scramble Address 
Data 1286. 

[0117] A subtracter 231 0 receives the inputs 1245a and 1245b, representing the likelihood of a "1" and a "0", re- 
spectively, and produces an output 2315 which feeds a second subtractor 2320. The output 2315 of the subtractor 
231 0 also feeds a first multiplexer 2330 and forms part of an output 1294. The second input to the subtractor 2320 is 

10 the extrinsic information 1242. The output 2325 of the subtractor 2320 is presented to a second multiplexer 2340. 
[01 18] The Traceback Process Output 1567 is presented as a second input to the first multiplexer 2330. The hard 
or soft output select 1213 is presented as the select input of the multiplexer 2330 and the output of the multiplexer 
2330 forms the zero bit of the decoded output 1 294. The output 231 5 of the subtractor 23 1 0 is combined with the least 
significant bit of the output of the multiplexer 2330 to form a multi-bit decoded output 1294. 

is [0119] The second multiplexer 2340 receives Scramble Address Data 1286 as its second input and Spreading Input 
1243 as its select signal. The second multiplexer 2340 produces an output 1293, which is fed from the LogLikelihood 
ratio processor 1 297 to the Intermediate Decoding Result and Memory 1 240. 

[01 20] The embodiment shown in Fig. 3 operates in a five-phase mode. As no loglikelihood processors are present, 
more path metric memory is required to store more alphas and betas in the computations performed by LogLikelihood 
20 Processors 1250a and 1250b in the embodiment of Fig. 4, which operates in a two-phase mode. 

Operation 

[0121] The first step In the operation of the decoder 1200 is to initialise the decoder such that the architecture em- 

25 bodies the required configuration of either convolutional decoding or turbo decoding. The variables available for ma- 
nipulation include the number of columns needed for the trellis size in question, the number of states in the trellis, the 
mask for the appropriate number of bits to be used in the addressing of the columns in the path metric memory and 
the decision depth of the traceback process. The register which holds the winning. path metric for the symbol being^ 
processed is initialised and sequential numbers are assigned to a register bank whose values are permuted between 

30 every symbol time to reflect the column address sequence required for each trellis operation. 

[0122] It is to be noted that the decoder 1200 can operate in either the forward or reverse trellis direction. 
[0123] In the case in which the trellis is being navigated in the forward direction, the Reverse Address Processor 
1 270 is configured to operate in transparent mode by setting the Reverse Trellis Transparent Bit 1 226. When navigating 
the trellis in the forward direction, the sequential numbers are rotated to the left after their first use. 

35 [0124] An iterative process begins by reading the path metrics from the column of the path metric store 1 280 B0 and 
B1 corresponding to the number of the iteration. The sequential list of path metrics held in the first column of 1280 B0 
and 1280 B1 are presented to the butterfly processors 1260. The butterfly processors 1260 produce, via the bank of 
multiplexers 1250c, new path metrics, which are no longer in sequential destination state order and are fed into the 
Forward Address Processor 1 290. The Forward Address Processor 1290 essentially performs a sort operation on each 

40 column of new path metrics with the resultant effect being that the columns in the path metrics memory 1 280 BO and 
B1 represent a set of sequential states when reading down the column. During each column operation, as shown in 
Figs 7A-E, half of the eight new path metrics are written directly into the path metric store 1 280, whilst the remaining 
new path metrics are written into the hold registers 2015a.. .2015d within the Forward Address Processor 1290. This 
alternates between each group of path metrics. 

45 [0125] The navigation through the forward trellis requires a number of column iterations, being one more than the 
number of columns needed for the particular trellis in question. If the number of iteration is even, path metrics from 
buses 1296A, C, E, G are written into the column of path metric store 1280 B0 corresponding to the number of the 
iteration. Path metrics from the buses 1296B, D, F, H are contemporaneously written into the hold registers 2015a... 
201 5d of the Forward Address Processor 1290. 

[0126] If, however, it is an odd iteration, the path metrics from buses 1296A, C, E, G are written into the hold registers 
2015a...2015d of the Forward Address Processor 1290 and path metrics from buses 1296B, D, F, H are written into 
the column of the path metric store 1280 corresponding to the number of the iteration. 

[0127] During the column operations, the decision bits 1255 generated by the ACS units of the butterfly processor 
1 260 are grouped into a byte and written into the Intermediate Decoding Memory and Processor 1 240. The next iteration 
55 in the process begins by reading the column address from the path metric store 1 280 B0 and B1 corresponding to the 
number of the next iteration. The iterative process continues until the number of column iterations corresponds to one 
more than the number of columns required for the trellis being calculated. 

[0128] A further write operation is required at the end of the iterative process to transfer the four new path metrics 
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2 nl h ° ld I* 9 *?' °' F0rW3rd Addr6SS Processor 1 290 - ™e *™ new path metrics are written into the final column 

t^s^'SsrsL s ■ The final resu * is that the new path mewcs have been written int ° p atn 

has no, changed ' " ^ H ° WeVer ' ' * t0 be n ° ted tnat the order within each column 

KLnll?^*!^" i8 ,' S bSin . 9 navi 9 ated in the reveree direction, the sequenUal numbers are rotated to the right 
and then used for the first time. A group of four path metrics are fetched from the first column of path metrics 1 280 BO 

?29oTco a nSrll e on °i 7 f erS Wi,Hin RSVerSe AddrSSS Pr0CeSS ° r 127 °- The Forward Add ^=s Processor 
In II 9 T t P m . a trans P arent mode b V setting the forward trellis transparent bit 1 236. The correspond- 
ng reverse trelhs transparent bit 1226 is set such that Reverse Address Processor 1270 is enabled The naSon 
through the reverse trellis is described in Figs 8A-8F. enaoieo. ne navigation 

10130] Navigating the trellis in the reverse direction requires a number of iterations corresponding to one more than 

° t ? r T ired Parti ° U,ar tr6 " iS - Wh6n naVi 9 ati "9 the tre,,is in the reverse diction theTn 

P 270 tfn^Zr! SV alwa ^P resente a ^rambled list of path metrics through the Reverse Address Processor 

produced by the butterfly processors 1260 is trellis state sequential 

Smo'^tK eVent J nat an even iteration is bei "9 undertaken, the column in the path metrics store 1280 B0 corre- 
SsoS 1278 and Z rI "'"o ° M " PaaSed thr0Ugh the multiplexers 1278a, normalising proo 

hi nth r !2 9 n AddreSS Processor 1270 10 *• butterfly processors 1260. The path metrics currently 
held in the Reverse Address Processor 1 270 are also read into the butterfly processor 1260. The column in path metric 

Pressor 1 ^70. * ** ° f ** fterati ° n * *"* ^ mt0 h °' d ° f th * ^"e^dd^ 

Sl^mhLVoftHf I r umb6rof tne iteration is odd . th e column of path metric store 1 280 B1 corresponding to 

27 8 Tth 6 ?JI! IT P o °" e IS ^ a " d PaSSed thr0Ugh tne multiplexers 1278a and normalising processors 
rZI ah? I « .I 0 ™ 830 ' 1 270 and the " 10 the butterf, y P rocessor 1 260 Th e P*h metric^ held in the 

mllrics Z I™**™ 127 °. are alS ° PreSentSd 38 inpUtS 10 thS bUtterf ^ processor 1260 - T "s -'umn of path 

^T^pSSSST 8 10 the number of the it6raaon is read and written int0 the hold re9ister 

52 m tl thiS t P ° int 0 L tbe D naVi9ati ° n °' reVerSe tre,lis ' th e sequential list of path metrics held in the first column 
PrcSssT Z Z* . and 81 iS PreSentSd t0 the ReVSree AddreSS Processor 127 °- Th « Averse Address 

l^ZZ 1 */k?T * °" C0 ' UrTln °' new path metncs t0 the effect th «t the resultant columns 

P 260 ^ 1 !• T V Pr ?. CeSS ° r 1 260 3re 00 '° n9er in Sequentia ' destination state order. The butterfly processor 
1 260 produces e.ght new path metncs, which are presented, via a bank of multiplexers 1 250c, to the Forward Address 
oZTZ 1290 - The Forward Address Processor 1290 is in transparent mode . so the tre.Hs-state J^SEES 
pa h me ncs produced by the butterfly processors 1260, via the bank of multiplexers 1250c. is written back into he 

rTd^ 

KSfL ™« h 6 f lum " epe^tions, the decision bits 1 255 generated by the ACS units of the butterfly processors 
1260 are grouped into a byte and presented to the Intermediate Decoding Memory and Processor 1240 The next 
teration commences by reading the appropriate column of path metrics from path metric stores 1 280 B0 and B1 

R1 »L£?1 IT thS iteratK,e Pr0CeSS ' th? " 6W path metrics are back in Path metrics store 1280 B0 and 

rnV, C i Tl > T m " ° rder U iS t0 bS " 0ted that the orderi ng within each column has not changed 
[0136] The traceback processor 1510 within the Intermediate Decoding Memory and Processor 1240 knows the 
ln!r!,t Pr0Ce ^ ln H d,recti0 " and the bit "^ion of the decision bit as it performs the well known pointer based traceback 

™^™f eC 'T u f ' S 6XtraCted fr ° m ° ne byte a0d iS US6d t0 9enera,e the ne)rt P oi ntar into the traceback 
TnZaZ It Z ITr ° „ te ™" ates w u hen a Predefined traceback depth has been achieved. The traceback depth is 
typically between five and nine times the constraint length of the code 

h™ JSTI ^ deC ° der 1200 iS bei " 9 US6d fprtUrt> ° deCOdinS " the Posing is broken into two distinct phases: 
dummy-beta/alpha processing and beta/LLR processing. When either the forward trellis or the reverse trellis operation 
is mentioned the above processing occurs, but only for the degenerate case of when the number of trellis states matches 
the number of ACS units ACS0...ACS7 in a multiple (power of 2) of the ACS unit size of the butterfly processed T 2 60 

^nltlf , ,^ 00 l PrOCeSSOr "° 12503 and ACS UnitS Within the butterfl y P^oessor 1260 are each equipped with 
rn?«f £ 8 res P ect,v e ACS units to accumulate results needed for alpha and beta calculations 

[0138] The calculation of dummy-betas and alphas occur in parallel. The LogLikelihood processor-0 1250a performs 
a dummy beta calculation using the leaf ACS units at its disposal. This calculation requires access to the input symbol 
history buffer and the Intermediate Decoding Memory and Processor interleaver memory, each of which is a windowed 
memory system. The .nput symbol history buffer is organised into banks 2216 and 2218 of the size of a processing 

oroSeTsed ft tffr F^TT* ' ^ aCCUmulates dumm y be tas by processing at time t the window to be 
processed at time t+1. The LogLikelihood processor-0 1250a does not need to access the path metric stores 1280 
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which is why the LogUkelihood ratio processor-0 1250a can operate in parallel to the ACS units contained within the 
Butterfly processors 1260. 

[0139] The LogUkelihood ratio processor-0 1250a performs normalisation on the dummy beta values by using the 
adders in the ACS tree to determine the maximum beta calculated. This value is then subtracted from the inputs to the 
leaf ACS units of the LogUkelihood ratio processor-0 1 250a before they are used. 

[0140] The butterfly processors 1260 perform alpha computations, accumulating alpha values in the registers con- 
tained within constituent ACS units. The butterfly processors 1260 perform the forward trellis operation and normali- 
sation as is usuai during the forward trellis navigation. 

[0141] The dummy betas calculated by the LogUkelihood ratio processor-0 1250a are presented to the butterfly 
processors 1260 at the start of the beta calculation phase. 

[01 42] During calculation of the betas, both LogUkelihood processors 1 250a and 1 250b are used in conjunction with 
the butterfly processors 1 260. Each of the LogUkelihood processors 1 250a, 1 250b accepts alphas from the path metric 
store 1 280, betas resulting from the previous clock cycle and extrinsic information 1 242 produced from the Intermediate 
Decoding Memory and Processor 1240 to create a LogLikelihood result for a "1" and "0", respectively. The LogLikeli- 
hood calculations can span multiple rows since they are determining the maximum result over all the states. 
[0143] Beta computations work in the reverse direction through the input symbol history window, compared to the 
alphas, and use gammas used in the alpha calculations. The beta computations use the same trellis branch metric 
assignments that were used for the alpha calculations. 

[0144] When the whole block of input history has been processed and the resultant outputs have been fed into the 
interieaver 1 520, the process is able to commence for the second half of the turbo decoder operation. The interleaver 
operation during first decoder operation is read sequentially and written sequentially. During second decoder operation, 
the interleaver is read from and written to, albeit using the random address sequence as determined by the scrambler 
address output. During second decoder operation, the read and write addresses are the same. The interleaver operation 
after the first decoder writes in sequentially and reads out randomly, as per the predefined spreading sequence which 
is used to give the first and second decoders their statistical independence. The interleaver operation for the second 
decoder writes randomly, as per the spreading sequence, and reads sequentially. 

[01 45] It is to be noted that because the encoders used for turbo encoding do not have to be the same, the decoding 
rates and constraints of the second decoder need not necessarily be the same as those for the first decoder. This may 
require that the configuration of the turbo decoder be changed between block processing operations. If this is the case, 
it is easily dealt with by manipulating the contents of the configuration registers. 

[01 46] Each block of input symbol history requires several complete turbo iterations in order to be decoded to within 
an acceptable bit error rate. The number of iterations required is configurable to ensure that the required bit error rate 
is achieved. 

[0147] A benefit of the architecture in question is that it only requires two phases to complete one turbo decode 
iteration. This provides flexibility in the use of the architecture and allows the number of decoder rows used to be traded 
for the number of Iterations required. For example, a turbo decoder that does four iterations may be implemented using 
two decoder rows requiring two iteration times. 

[0148] LogMAP computation is performed using a sliding window algorithm. The sliding window algorithm is imple- 
mented in 2 phases. In a single decoder this results in increased latency: 2 passes over each window as shown in the 
configuration (with only a single decoder being used) in Fig. 20A. The first pass computes the dummy beta values and 
the forward alpha values in parallel and stores the forward alpha values in the alpha memory (NOTE: this memory is 
the same memory as used in the Viterbi algorithm for path metric storage). The second pass reads the alpha values 
and computes the beta values according to the LogMAP algorithm and outputs LogLikelihood ratios (LLR). 
[0149] When multiple decoders are used, the computation of the two phases can be overlapped and the decoder 
can process a single block with reduced latency. Multiple decoders can operate separately on different data streams 
or they can co-operate to increase the decoding speed of a single stream, as shown in the configuration of Fig. 20B. 
The implementation shown in Fig. 3 can process 4 independent streams or 2 streams with increased speed (and 
reduced latency) or 1 stream with further increased speed (and minimal latency). 

[0150] Table 1 demonstrates the flexibility of the unified decoder to support multiple encoded streams simultaneously. 
For example, a decoder with 4 decoder rows can process up to 4 data streams at the same time. Furthermore, the 
decoder rows can operate together to decode fewer streams at higher throughput. This is useful for minimizing the 
latency of voice decoding. Table 1 demonstrates the flexibility of this approach and the appropriate decoding speed- 
up obtained in each case. (Again - this list is by no-means complete - more decoder rows can be connected together 
to achieve even greater flexibility.) 
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Example decoding configurations of multi-bank interconnected decoders. 


Scenario 


Decoder Configuration 


uecoaing opeea-up (over convolutional/ 
turbo on 1 decoder) 


1 convolutional 


4 decoders per stream 


4X (conv) 


2 convolutional 


2 decoders per stream 


2X (conv), 2X (conv) 


3 convolutional 


1 decoder for 2 streams, 2 decoders for I 
stream 


1X (conv), 1X (conv), 2X (conv) 


4 convolutional 


1 decoder per stream 


1X, 1X, 1X, 1X 


1 turbo 


2 decoders per stream 


2X (turbo) 


2 turbo 


1 decoder per stream 


1X, 1X 


4 turbo 


1 decoder per stream 


1X, 1X, 1X, 1X 


1 convolutional & 1 turbo 


1 decoder conv, 2 decoders turbo 


1X (conv), 2X (turbo) 



[0151] To demonstrate how 2 or 4 decoders can co-operate to decode fewer data streams at a higher speed Fiq 
21 shows the interconnections between two decoders. The boxes marked »M« are multiplexers that enable some of 
the path metncs from adjacent decoders to be swapped before writing to the path metric memories. In this manner 
he decoders can operate as a single decoder. Furthermore, Fig. 22 shows how 4 decoders can be interconnected to 
function as either a single decoder, two separate decoders, or 4 separate decoders 

^!f 21 . T ! d * monstrate the multi-standard nature of the unified decoder, the decoder can support any combination 
of the standards shown in Table 2. fThis list is by no means complete - but is included to demonstrate the flexible (and 
therefore useful) nature of this unified decoder). K 



Table 2: 



Example of standards supported by unified decoder. 



Standard 


Code Rate 


Constraint Length 


GSM - full-rate voice 




5 


GSM - half -rate voice 


1/3 


7 


GSM- data full-rate (9.6 Kbps) 




5 


GSM - data full rate (4.8 Kbps) 


1/3 


5 


GSM - data full rate (2.4 Kbps) 


1/6 


5 


GPRS - CS-1 




5 


EDGE - MCS (1-9) 


1/3 


7 


GSM-AMR TCH/AFS6;7 


% 


7 


GSM-AMR TCH/AFS5.1 5 


1/5 


7 


UMTS Voice (slotted) 




9 


UMTS Voice (normal) 


1/3 


9 


UMTS Data (Turbo) 


Vz 


4 


UMTS Data (Turbo) 


1/3 


4 


CDMA 2000 Voice 




9 


CDMA 2000 Voice 




9 


CDMA 2000 Data (Turbo) 




4 



[0153] The unified decoder 900 implements the decoding required for convolutional encoded and turbo encoded 
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data streams and can support multiple data streams and multiple voice streams simultaneously. When decoding Turbo- 
encoded data streams, this decoder implements an iterative Turbo decoder using either the MAX-LOG MAP or the 
LOG-MAP soft-output MAP algorithms. The decoder maximizes the re-use of its components to enable the efficient 
implementation of both convolutional and turbo decoding systems. 

5 [0154] The decoder can be dynamically partitioned, as required, to decode voice streams for different standards. 
The decoder can process streams with different coding rates (rate 14, rate 1/3, rate etc.). It can also process streams 
encoded with different constraint lengths. As such, the unified decoder architecture is capable of supporting each of 
the mobile wireless standards currently defined: first, second and third generation for both voice and data. 
[0155] The unified decoder architecture of the preferred embodiment encapsulates the functionality of nori-system- 

10 atic (feed forward) encoders and systematic encoders (feed backward) in a single architecture. Fig. 23A shows mixing 
of polynomials 3240 and state bits 3250 to produce a single code bit 3225_0 of a code word 3225. Polynomials 3240 
are presented to corresponding AND gates 3260, which also receive states 3250 as inputs. Each of the AND gates 
3260 produces an output to a corresponding XOR gate 3270. Each XOR gate 3270 also receives a TRANSITION 
INPUT 3280 and produces an output 3225_0 of the M-bit n on -systematic encoder 3230. 

is [01 56] Fig. 23B shows a whole encoder 3200 for a code word 3225. Polynomials 3240 are presented to corresponding 
M-bit non-systematic encoders 3230. An input bit 3220 is presented to an XOR gate 3275. A RSC_ENABLE signal is 
presented to an AND gate 3280, the output of which is the second input of the XOR gate 3275. The AND gate 3280 
also receives as an input the output of the encoders 3230. The XOR gate 3275 presents an output to an M-bit shift 
register 3210 and to each of the encoders 3230. The M-Bit Shift Register 321 0 also receives a clock signal 3285 and 

20 a reset signal 3290 and holds a state of the encoder 3200 at a time T. The state value is used in conjunction with each 
particular polynomial 3240 (as specified by a particular code) to produce a ho n -systematic code bit. The output 3250 
of the register 321 0 is broadcast to each of the encoders 3230. The outputs 3225_0...3225_R of the encoder 3230 are 
collated to form the CODEJ/VORD 3225. 

[0157] By enabling the RSC_ENABLE 3215, the encoder 3200 becomes a recursive, systematic (RS) encoder. In a 
25 recursive, systematic code, the input bit 3220 forms the systematic bit of a code word 3225. The generated bits of each 
M-Bit Encoder 3230 form the remainder of the RS code word 3225. 

[0158] In the case of a non-systematic encoder the CODE_WORD 3225 would contain R bits (where R = the rate 
of the code). When the RSC_ENABLE 3215 is active, the CODE_WORD 3225 is typically 1-bit wide. The output 
CODE_WORD 3225 (in this case 1-bit wide) and the INPUT_BlT 3220 form the RS code word. 
30 [01 59] It is apparent from the above that the embodiment(s) of the invention are applicable to the decoding of multiple 
wireless transmission standards using a unified, scalable architecture. . 

[01 60] The foregoing describes only one embodiment/some embodiments of the present invention, and modifications 
and/or changes can be made thereto without departing from the scope of the invention, the embodiment(s) being 
illustrative and not restrictive. 
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Claims 

1 . Butterfly processor arrangement for use in telecommunications decoding, said arrangement comprising: 



first and second add-compare-select modules, said modules each receiving first and second input path metrics 
supplied to said arrangement and wherein each said module comprises log-sum correction means coupled 
to compare and select components of said module and a controllable switch for selectively coupling outputs 
of said select component and said log-sum correction means to an output of said module; 

45 a branch metric calculator, said branch metric calculator receiving input data and extrinsic data and arranged 

to generate first and second output branch metrics supplied to each of said add-compare-select modules, said 
second output branch metric being the arithmetic inverse of the first output branch metric; and 
means for actuating said switch such that when said select component is output, said arrangement operates 
for convolutional decoding and when said log-sum correction means is output said arrangement operates for 

so LOGMAP decoding. 

2. An arrangement according to claim 1 , wherein said log-sum correction means comprises a log-sum correction 
table having an address input connected to a difference output of said compare component, said table outputting 
a log-sum value to a first input of an adder, a second input of which being coupled to the output of the select 

55 component, said adder providing an output of said log-sum correction means. 

3. A butterfly processor arrangement for use in telecommunications decoding, said arrangement comprising: 
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first and second add-compare-select means, each said add-compare-select means for receiving first and sec- 
ond input path metrics supplied to said arrangement and wherein each said add-compare-select means in- 
cludes log-sum correction means coupled to compare and select components of said add-compare-select 
means; 

a controllable switch means for selectively coupling outputs of said select component and said log-sum cor- 
rection means to an output of said add-compare-select means; 

branch metric calculator means for receiving input data and extrinsic data and arranged to generate first and 
second output branch metrics supplied to each of said add^ompare-select means, said second output branch 
metric being the arithmetic inverse of the first output branch metric; and 

means for actuating said switch means such that when said select component is output, said arrangement 
operates for convolutiona! decoding and when said log-sum correction means is output said arrangement 
operates for LOGMAP decoding. 

An arrangement according to claim 1, wherein said, log-sum correction means comprises a log-sum correction 
table means having an address input connected to a difference output of said compare component, said table 
means outputting a log-sum value to a first input of an adder means, a second input of which being coupled to the 
output of the select component, said adder means providing an output of said log-sum correction means. 

A method for performing butterfly processing in telecommunications decoding, said method comprising the steps of: 

generating first and second branch metrics from input data and extrinsic data, said second branch metric being 
the arithmetic inverse of said first branch metric; 

performing add-compare-select operations on first and second input path metrics and said first and second 
branch metrics; 

log-sum correcting a select operation output to produce a log-sum corrected output; and 
selectively coupling said log-sum corrected output and said select operation output to produce an output; 
whereby when said select operation output is output, said butterfly processing operates for convolutional de- 
coding and when said iog-sum corrected output is output, said butterfly processing operates for LOGMAP 
decoding. 

The method according to claim.5, whereby said log-sum correcting utilises a log-sum correction table, the method 
comprising the further steps of : 

providing a difference output of said compare operation to an address input of said log-sum correction table 
to output a corresponding log-sum value; and 

adding said iog-sum value to said select operation output to provide said log-sum corrected output. 
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